Discovering Pattern Tableaux for Data Quality Analysis: a Case Study
نویسندگان
چکیده
In this paper, we present a case study that illustrates the utility of pattern tableau discovery for data quality analysis. Given a usersupplied integrity constraint, such as a boolean predicate expected to be satisfied by every tuple, a functional dependency, or an inclusion dependency, a pattern tableau is a concise summary of subsets of the data that satisfy or fail the constraint. We describe Data Auditor—our system for automatic tableau discovery from data—and we give real-life examples of characterizing data quality in a network monitoring database used by a large Internet Service Provider.
منابع مشابه
Efficient and Effective Analysis of Data Quality using Pattern Tableaux
Data Auditor is a system for analyzing data quality via exploring data semantics. Given a user-supplied constraint, such as a functional dependency or an inclusion dependency, the system computes pattern tableaux, which are concise summaries of subsets of the data that satisfy (or fail) the constraint. The engine of Data Auditor is an efficient algorithm for finding these patterns, which defers...
متن کاملData Auditor: Exploring Data Quality and Semantics using Pattern Tableaux
We present Data Auditor, a tool for exploring data quality and data semantics. Given a rule or an integrity constraint and a target relation, Data Auditor computes pattern tableaux, which concisely summarize subsets of the relation that (mostly) satisfy or (mostly) fail the constraint. This paper describes 1) the architecture and user interface of Data Auditor, 2) the supported constraints for ...
متن کاملInvestigation of Rural Area and Development Strategies; Case Study of Central District of Semnan County
Generally, studied villages as regard to the Socioeconomic structure have not considerable development in comparison with their around structure due to functional area. Nowadays, regarding rural development, proceeding with economical analysis of geographical areas has a great position. Considering the applied investigations, upon increasing the knowledge and literacy and scientific and practi...
متن کاملتحلیل الگوهای همزمان در نمودارهای کنترل فرآیند آماری با استفاده از شبکه عصبی
Statistical Process Control (SPC) charts play a major role in quality control systems, and their correct interpretation leads to discovering probable irregularities and errors of the production system. In this regard, various artificial neural networks have been developed to identify mainly singular patterns of SPC charts, while having drawbacks in handling multiple concurrent patterns. In th...
متن کاملThe Concept of Quality in Public Courtyards: Explanations and Analyses. Case Study: Mausoleum of Shah Ni’mat-Allah Vali
Quality is a highly esoteric concept which compels theorists to offer different explanations. Based on library resources, quality can be defined as an interaction between individuals and their environment, which is caused by a set of environmental components differing in each environment. This paper studied the concept of quality in public courtyards. The Mausoleum of Shah Ni’mat-Allah Vali, wh...
متن کامل